Picture for K R Prajwal

K R Prajwal

Recognizing Co-Speech Gestures in-the-Wild

Add code
May 29, 2026
Viaarxiv icon

Understanding Co-speech Gestures in-the-wild

Add code
Mar 28, 2025
Figure 1 for Understanding Co-speech Gestures in-the-wild
Figure 2 for Understanding Co-speech Gestures in-the-wild
Figure 3 for Understanding Co-speech Gestures in-the-wild
Figure 4 for Understanding Co-speech Gestures in-the-wild
Viaarxiv icon

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Add code
Oct 27, 2024
Figure 1 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 2 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 3 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 4 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Viaarxiv icon

A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision

Add code
May 16, 2024
Figure 1 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Figure 2 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Figure 3 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Figure 4 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Viaarxiv icon

Weakly-supervised Fingerspelling Recognition in British Sign Language Videos

Add code
Nov 16, 2022
Figure 1 for Weakly-supervised Fingerspelling Recognition in British Sign Language Videos
Figure 2 for Weakly-supervised Fingerspelling Recognition in British Sign Language Videos
Figure 3 for Weakly-supervised Fingerspelling Recognition in British Sign Language Videos
Figure 4 for Weakly-supervised Fingerspelling Recognition in British Sign Language Videos
Viaarxiv icon

Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild

Add code
Sep 01, 2022
Figure 1 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Figure 2 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Figure 3 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Figure 4 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Viaarxiv icon

Automatic dense annotation of large-vocabulary sign language videos

Add code
Aug 04, 2022
Figure 1 for Automatic dense annotation of large-vocabulary sign language videos
Figure 2 for Automatic dense annotation of large-vocabulary sign language videos
Figure 3 for Automatic dense annotation of large-vocabulary sign language videos
Figure 4 for Automatic dense annotation of large-vocabulary sign language videos
Viaarxiv icon

Visual Keyword Spotting with Attention

Add code
Oct 29, 2021
Figure 1 for Visual Keyword Spotting with Attention
Figure 2 for Visual Keyword Spotting with Attention
Figure 3 for Visual Keyword Spotting with Attention
Figure 4 for Visual Keyword Spotting with Attention
Viaarxiv icon

Visual Speech Enhancement Without A Real Visual Stream

Add code
Dec 20, 2020
Figure 1 for Visual Speech Enhancement Without A Real Visual Stream
Figure 2 for Visual Speech Enhancement Without A Real Visual Stream
Figure 3 for Visual Speech Enhancement Without A Real Visual Stream
Figure 4 for Visual Speech Enhancement Without A Real Visual Stream
Viaarxiv icon

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

Add code
Aug 23, 2020
Figure 1 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Figure 2 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Figure 3 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Figure 4 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Viaarxiv icon